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ABSTRACT 

^rWe  discuss  a  system  for  translating  regular  expressions  into  logic  equations  or  PLA’s,  with  particular 
attention  to  how  we  can  obtain  both  the  benefits  of  regular  expressions  and  state  machines  as  input 
languages.  An  extended  example  of  the  method  is  given,  and  the  results  of  our  approach  is  compared 
with  hand  design;  in  this  example  we  use  less  than  twice  the  area  of  a  hand-designed,  machine 
optimized  PLA.  x~~^" 


I.  The  Regular  Expression  Compiler 

A  collection  of  routines  have  been  written  by  II.  Trickey  and  J.  Ullman  to  translate  regular  expressions  into 
circuits.  At  present,  we  first  compile  regular  expressions  into  a  language  that  describes  nondeterministic 
finite  automata  (NFA’s).  These  NFA’s  arc  then  compiled  into  either  PLA’s  or  S.  C.  Johnson’s  Igen  logic 
language. 

A  description  of  the  regular  expression  language  appears  in  [TU].  The  language  is  quite  standard,  with 
perhaps  the  following  exceptions. 

1.  Input  symbols  are  not  “disjoint,”  in  the  sense  that  at  any  time  only  one  can  be  seen  on  the  input. 
Rather,  >nput  symbols  are  defined  in  terms  of  some  set  of  wires  being  on  or  off.  Since  not  all  wires  must 
be  specified  for  each  symbol,  there  is  the  possibility  that  two  or  more  symbols  arc  on  at  a  time.  This 
has  the  consequence  that  apparently  deterministic  processes  can  in  fact  have  nondcterininisin  in  them. 

2.  Output  signals  are  represented  by  ordinary-looking  symbols  in  regular  expressions.  When  the  input  is 
such  that  an  output  symbol  is  readied  in  the  expression,  we  emit  that  signal,  and  proceed  to  recognize 
any  continuation  of  the  expression  that  the  input  allows  us  to  recognize. 

Example  1:  In  Fig.  1  we  sec  an  input  to  the  regular  expression  compiler  that  forms  a  running  example 
for  this  report.  Without  dealing  now  with  the  issue  of  what  this  program  docs,  let  us  observe  a  few  salient 
features.  The  first  line  says  that  there  arc  seven  input  wires,  x[l], . . .  ,x[7).  Next  come  the  definitions  of  the 
input  symbols.  For  example,  we  see  signal  inO  on  the  input  whenever  the  first  wire  is  on  and  the  second  off. 
Note,  for  example,  that  we  could  sec  symbol  *n0  and  also  acka,  if  the  first  three  wires  were  1,  0,  and  1. 
f  Work  supported  l>y  DAIM’A  contract  MDA  D03  80-C-0107. 


Following  this  come  the  declaration  of  output  signals,  and  then  three  subexpressions,  aomein,  which  is 
recognized  when  either  tnO  or  ini  is  seen,  waitin  which  is  recognized  when  neither  input  is  seen,  or  both 
wires  x[l]  and  x[2)  are  on  simultaneously  (which  represents  a  “bad  input,”  the  symbol  badin),  and  ailbutOl, 
which  stands  for  the  union  of  all  signals  but  inO  and  ini.  After  the  declaration  portion  is  a  semicolon  and 
the  expression  itself. 

As  an  example  of  how  the  expression  is  to  be  interpreted,  consider  the  seventh  line  of  the  expression, 
beginning  stateia •  •  ■■  It  says  that  if  we  get  a  signal  telling  us  we  are  in  state  a,  and  then  receive  any  number 
of  symbol  noacka  ( noacka *  means  “any  number  of  noacka' s”),  we  emit  the  signal  OUT  A.  We  regard  x[3] 
as  a  wire  that  “acknowledges”  the  fact  that  signal  OUTA  was  received,  so  symbol  noacka,  defined  by  z[3] 
being  off,  is  seen  until  the  acka  symbol,  x[3]  =  1,  is  seen.  In  effect,  we  emit  the  output  signal  OUTA  until 
it  is  acknowledged. 

When  OUTA  is  acknowledged  by  acka  appearing  on  the  input,  the  process  of  recognizing  the  expression 
proceeds  to  waitin* ,  which  is  recognized  for  as  long  as  the  first  two  wires  remain  at  0,  or  both  become  l 
simultaneously  (a  bad  input).  Then,  when  *n0  or  tnl  is  seen,  a  signal  to  change  to  state  b  or  c  is  made.  If 
any  of  the  symbols  represented  by  allbut 01  is  received  after  the  acka,  an  error  is  declared.  D 

II.  Combining  States  and  Expressions 

The  motivation  for  using  regular  expressions  as  a  source  language  is  twofold.  First,  they  are  a  succinct  and 
nonprocedural  description  of  a  large  class  of  sequential  processes.  Thus  they  can  provide  some  simplification 
in  the  design  process  for  the  right  problem.  Second,  being  structured  descriptions  of  patterns,  they  are 
appropriate  for  proofs  of  correctness,  and  even  if  a  formal  proof  is  not  attempted,  they  provide  useful 
intuition  that  helps  the  reader  convince  himself  of  the  correctness  of  the  expression.  In  comparison,  transition 
functions  for  automata  arc  analogous  to  programs  with  goto’s;  they  are  inherently  hard  to  understand  and 
verify,  either  formally  or  intuitively. 

On  the  other  side  of  the  coin,  there  are  distinct  advantages  to  process  descriptions  involving  states. 
Often,  it  is  natural  to  view  a  process  as  being  in  one  of  several  states;  for  example,  counting  is  especially 
easy  when  you  have  states  available  and  very  hard  to  do  with  regular  expressions.  It  is  the  purpose  of  this 
report  to  dcscibc  a  simple  modification  to  the  regular  expression  compiler  that  allows  us,  in  effect,  to  declare 
states  and  then  define  transitions  among  states  in  regular  expression  terms.  As  a  result,  we  get  the  best 
of  both  worlds;  states  arc  available  when  they  arc  more  succinct  than  regular  expressions  or  when  they 
help  us  organize  our  design,  and  regular  expressions  are  available  when  patterns  of  symbols  arc  useful  as  a 
description  of  events. 
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line  x[7j 
■ymbol  inO(x(l]  -x[2j) 
ml(x(2)  -x(Il) 
badin(x|l]  x[2]) 
acka(x[3]) 
ackb(x[4]) 
aekc(x[5}) 
atateia(x(6]  x[7]) 
atatcib(x[6)  -x(7j) 
statcie(-x[6]  x[7]) 
sl*rl(-x{6]  *x[7|) 
noin(-x(l]  -x[2j) 
noacka(-x[3j) 
noaekb(-x(4]) 
noackc(-x[S]) 
output  OUTA 
OUTB 
OUTC 
atateoa 
atateob 
atateoe 
ERROR  * 

subexp  somein=inO  +  ini  4-  badin 

aubcxp  waitin=noin  +  badin 

subexp  allbut01=acka  +  aekb  +  aeke  +  badin 

I 

atari  waitin'  ( 

allbutOl  ERROR  + 
inO  atateoa  + 
ini  atateob 
) 

+ 

atateia  noacka*  OUTA  ( 

(ackb+acke+aomein)  ERROR  +• 
aeka  waitin'  ( 

allbutOl  ERROR  + 
inO  atateob  + 
ini  atateoe 
) 

) 

+ 

atateib  noackb*  OUTB  ( 

(acka+ackc+aomeln)  ERROR  4* 
aekb  waitin'  ( 

allbutOl  ERROR  + 

InO  atateoe  + 
ini  atateoa 
) 

) 

+ 

atateie  noacke*  OUTC  ( 

(aeka+aekb+somein)  ERROR  + 
aeke  wailin'  ( 

allbutOl  ERROR  + 
inO  atateoa  + 
ini  atateob 
) 

) 


Fig.  1.  Input  to  regular  expression  compiler. 
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To  introduce  stales  into  the  regular  expression  language,  we  make  the  following  modifications. 

1.  The  names  stateiX  tor  any  X  are  input  symbols  that  represent  the  fact  that  we  have  just  entered  state 
X.  Symbol  start  serves  as  the  initial  state.  To  indicate  that  these  states  are  disjoint,  i.e.,  we  can  be 
in  only  one  of  them  at  a  time,  we  can  use  imaginary  wires,  such  as  x[6]  and  x[7]  in  Fig.  1,  to  make  it 
appear  to  the  compiler  that  at  most  one  of  these  input  symbols  can  be  present  on  the  input  at  any  time. 
Of  course,  if  the  states  were  not  disjoint  in  this  sense,  we  could  express  the  legal  subsets  by  another 
combination  of  dummy  wires. 

2.  Output  symbols  stateoX  for  any  X  are  used  as  goto’s.  If  we  emit  the  symbol  statcoX,  we  shall  in  clfect 
turn  on  the  input  symbol  stateiX  and  enter  state  X. 

The  complete  regular  expression  cousists  of  the  sum  of  expressions  that  begin  start  and  stateiX  for  the 
various  X’s.  The  portion  of  the  regular  expression  associated  with  each  state  is  recognized,  if  possible,  each 
time  we  enter  that  state,  and  we  make  whatever  outputs  the  regular  expression  tells  us  to  make  in  response 
to  what  inputs  we  see. 

After  the  regular  expression  compiler  converts  the  expression  into  a  nondcterministic  finite  automaton, 
an  edit  script  is  used  to  identify  the  input  symbol  stateiX  with  the  output  symbol  stateoX  and  make  certain 
other  changes  so  things  work  properly. 

Example  2:  A  case  in  point  is  the  problem  to  which  the  regular  expression  program  of  Fig.  1  is  a  solution. 
This  program  implements  the  transmitter  from  [AUY]  that  sends  bits  reliably  over  a  channel  that  has  a  high 
probability  of  losing  bits,  but  does  not  change  0’s  into  l’s  or  vice-versa.  This  view  of  a  channel  is  plausible  if 
we  assume  that  any  noise  or  other  error  causes  the  system  to  fail  to  detect  a  bit.  This  strategy,  of  assuming 
no  signal  whenever  something  goes  wrong,  is  modeled  after  the  Datakit  protocol  [F]. 

The  general  idea  is  that  when  the  transmitter  is  given  a  bit  to  send,  it  sends  one  of  three  signals 
OUTA,  OUTD,  or  OUTC,  chosen  by  a  method  to  be  described.  It  keeps  sending  the  signal  until  it  receives 
an  acknowledgement  of  the  signal  sent.  Then,  it  stops  sending  the  signal  until  the  next  input,  0  or  1,  is 
received,  whereupon  it  sends  the  next  signal  (in  the  sense  that  c  follows  6,  which  follows  o,  which  follows  c) 
if  0  is  input,  and  it  sends  the  previous  signal  in  this  cyclic  order  if  1  is  input. 

I 

The  purpose  of  this  arrangement  is  so  that  whenever  the  transmitter  sends  a  new  input,  it  changes  the 
signal  sent;  that  change  serves  to  acknowledge  the  acknowledgement.  If  we  did  not  always  make  a  signal 
change,  the  receiver  could  not  tell,  say  upon  receiving  two  0’s,  whether  these  were  two  different  inputs,  or 
the  acknowledgement  of  the  first  had  been  lost,  and  the  second  0  was  a  retransmission  of  the  first. 

Another  way  to  look  at  the  signal  selection  algorithm,  is  that  we  count  one  for  an  input  0  and  two  for 
an  input  I,  and  transmit  OUTA,  OUTU,  or  OUTC  depending  on  whether  the  sum  of  inputs  received  so 


far  is  congruent  to  1,  2,  or  0,  modulo  3.  Counting,  even  counting  modulo  3,  is  very  difficult  to  express  in 
the  regular  expression  language.  Thus  it  is  natural  to  introduce  three  states,  a,  6,  and  c,  that  are  entered 
whenever  we  receive  an  input  that  makes  this  running  modular  sum  1,  2,  or  0,  respectively. 

We  already  discussed  briefly  in  Example  1  what  happens  in  one  of  these  states,  say  a.  After  receiving  the 
atateia  signal  to  say  we  have  entered  state  a,  we  emit  OUT  A  for  as  long  as  the  input  matches  noacka*,  that 
is,  the  acka  acknowledgement  is  not  received.  A  sequence  of  noacka' s  can  be  followed  by  either  of  two  events 
that  cause  special  outputs.  First,  the  acka  signal  can  be  received,  and  then,  after  any  sequence  of  waitin's, 
i.e.,  no  input,  an  tnO  or  tnl  triggers  a  signal  that  causes  a  jump  to  another  state,  b  or  c,  respectively.  After 
receiving  acka,  any  input  but  in 0  or  tnl  causes  an  error  signal.  Note  that  allbutOl  and  waitin  can  be  seen 
at  the  same  time,  so  we  can  continue  waiting  for  a  good  input  even  while  signaling  when  errors  occur. 

Now  let  us  return  to  the  point  in  the  expression  where  we  are  recognizing  noacka*  and  waiting  for  acka. 
At  the  same  time  we  are  waiting  for  acka,  if  we  receive  ackb,  ackc,  or  aomein,  we  have  an  error  condition; 
in  the  first  two  cases,  the  wrong  acknowledgement  was  received,  in  the  last,  we  received  either  a  bad  input, 
or  a  good  input  before  we  are  ready  to  transmit  it.  In  this  case,  we  emit  the  output  signal  ERROR.  Note 
that  all  of  these  error  conditions  arc  seen  on  the  input  at  the  same  time  noacka  is  seen,  so  emitting  ERROR 
docs  not  prevent  us  from  continuing  to  see  noacka*  and  eventually  to  sec  acka  and  another  input.  However, 
inputs  received  erroneously  do  not  cause  a  change  of  state,  because  we  cannot  reach  a  term  like  «'n0  atateob 
in  the  regular  expression  until  after  the  acka  has  been  received. 

The  portions  of  the  expression  following  atateib  and  atateic  are  analogous  to  what  we  have  described. 
The  portion  following  start  differs  only  in  that  we  are  not  . waiting  for  an  acknowledgement,  and  if  any  is 
received  it  is  an  error. 

The  result  of  compiling  Fig.  1  is  shown  in  Fig.  2.  This  figure  illustrates  the  NFA  language  used.  Each 
type  of  statement  begins  with  a  unique  letter.  For  example,  D  is  a  declaration  of  an  input  symbol,  much 
like  in  the  regular  expression  compiler.  However,  note  that  the  input  symbol  atateiX  and  the  output  symbol 
statcoX  have  both  become  atateX,  and  this  input  symbol  is  declared  (in  lines  6-8,  c.g.)  to  be  present  when 
the  wire  of  the  same  name  is  on;  that  wire  is  the  corresponding  output  signal. 

The  letter  N  indicates  the  name  of  the  NFA,  and  F  indicates  the  name  of  the  final  signal,  if  any  (there 
is  none  in  Fig.  2),  and  the  slates  that  cause  the  final  signal  to  be  emitted.  Letter  I  introduces  the  name  of 
the  initial  signal,  init  in  this  case,  and  a  list  of  the  initial  slates,  st2,  at 3,  and  so  on. 

A  state  is  declared  by  the  letter  S,  followed  by  the  state  name,  and  the  input  symbol  that  it  recognizes. 
The  NFA  langaugc  is  restricted  in  that  each  state  recognizes  only  one  input  symbol.  However,  this  restriction 
is  not  bothersome  for  NFA’s  that  arc  output  by  the  regular  expression  compiler,  and  in  general,  \vc  can  create 


D  inO  (  inO-inl) 

0  ini  ( -inO  ini) 

D  badin  (  inO  ini) 

D  acka  (  aeka) 

D  aekb  (  aekb) 

D  aeke  (  acke) 

D  states  (states) 

D  stateb  (stateb) 

D  statec  (statec) 

D  noin  ( -inO  -ini) 

D  noacka  ( -aeka) 

D  noackb  (  -aekb) 

D  noacke  (  -acke) 

N  nfal 
Fi 

I  init;  st2  st3  st4  stS  st$  st9  stll 
S  st2  noin 

T  st2  st3  st4  st5  st6  st9  stll 
S  st3  badin 

T  st2  st3  st4  stS  st6  st8  st9  stll  - 

S  st4  acka 

T  st8 

S  stS  aekb 

Tst8 

S  stB  acke 

T  st8 

S  st7  O  stateb 
S  st8  O  error 
S  st9  inO 
TstlO 

S  stlO  O  states 
S  stll  ini 
T  st7 

S  stl2  O  statec 
S  stl3  states  X 

T  stS  st6  st 14  stlS  stl7  stl8  stl9  st22 
S  stl4  noacka 

T  stS  st6  stl4  stl5  stl7  stl8  stl9  st22 
S  stl5  O  outa 
S  stl6  badin 

T  st-*  stS  stS  st8  st9  stll  stl6  st29 
S  stl7  badin 
T  st8 
S  stl8  inO 
T  st8 
S  stl9  int 
T  st8 

S  st20  badin 

T  st 4  st5  stS  stl7  st20  st27  st31  st33 
S  st21  statec  X 

T  st4  stS  st 1 7  sl!8  stl9  st25  st28  st37 
S  sl22  acka 

T  sl4  stS  sl6  slt7  sl23  sl24  st30  st32 
S  st23  noin 

T  sl4  stS  stS  stl7  st23  st24  sl30  st32 
S  st24  badin 

T  sl4  stS  stS  st!7  st23  st24  st30  stS3 


Fig.  2(m).  Beginning  of  NFA  description. 
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S  (125  ackc 

T  (14  st5  (t6  at<  (tit  it  16  (t2t 
S  st26  noacke 

T  st4  itS  stl?  (tl8  (tit  it2S  st26  (tS 7 
S  it27  noin 

T  (t4  atS  (t6  itl7  (c20  (t27  (tSl  itSS 
S  at28  ackb 

T  at4  atS  at6  (tl7  it20  at27  (tSl  itSS 
S  at29  noin 

T(t4  atS  at6  itt  (tit  itll  at2t 

S  at30  inO 

Tit7 

S  it31  InO 

Titl2 

S  it32  ini 

T  stl2 

S  st33  ini 

T  stlO 

S  st34  atateb  X 

T  st4  sl6  atl7  stl8  stlt  at28  itSS  stSt 
S  st 35  nosckb 

T  st4  st6  it  17  stl8  atlt  at28  itSS  at36 
S  at36  O  outb 
S  st37  O  oute 

C  st2;  st3  st4  st5  sl6  st8  stt  it  11 
C  sl3;  st4  st5  st6  st8  stt  stll 

C  st4;  sl5  st6  st8  stt  stll  stl6  itl7  stl8  (tit  «t20  st23  st24  st25  st2S  (t27  at28  at2t  (ISO  sl31  it32  it33 
st35  st36  st37 

C  st5;  st6  st8  stt  stll  stl4  itlS  stlS  (tl7  stlt  atlt  st20  st22  (123  st24  st25  st26  st27  st2t  st30  st31 
st32  st33  st37 

C  stC;  st8  stt  stll  stl4  stlS  stl6  (tl7  stl8  stlt  st20  st22  St23  st24  st27  st28  st29  st30  st31  st32  st33  st35  st36 
C  it7;  st8 

C  st8;  stt  stlO  stll  atl2  stU  stlS  stll  stl7  stll  stlt  st20  st22  st23  st24  st2S  st26  st27  st28  st2t 
st30  it 31  sl32  st33  st35  itSS  st37 
C  stt;  stll  stl6  st2t 
C  stll;  stlS  st2t 
C  stl3;  it21  st34 
C  stl4;  stl 5  stl7  st!8  atlt  st22 
C  stlS;  stl7  stl8  stlt  st22 
C  stl6;  st21 

C  stl7;  a.18  atlt  st20  st22  st23  at24  st2S  (t26  st27  st28  it 30  st31  st32  stSS  st3S  st36  st37 

C  stl 8;  stlt  st22  st25  st26  st28  itSS  st36  st37 

C  stlt;  st22  sL25  st2G  st28  stSS  st38  st37 

C  st20;  st27  st31  st33 

Cit21;it34 

C  st23;  st24  stSO  st32 

C  st24;  st30  st32 

C  at2S;  st26  st37 

C  st26;  st37 

C  st27;  st31  st33 

C  at28;  st35  st36 

C  sl30;  st32 

0  st3 1 ;  *133 

C  st35;  st36 

E 


i 


Fig.  2(b).  End  of  NPA  description. 


several  states  with  the  same  predecessors  to  simulate  one  state  with  transitions  on  several  inputs. f 

An  0  preceding  the  symbol  associated  with  a  state  means  that  the  symbol  is  an  output  symbol,  rather 
than  an  input  symbol.  The  letter  X  following  the  symbol,  as  in  «tl3,  means  that  the  state  is  external;  it  is 
always  on  and  waiting  to  see  its  input  symbol. $  It  is  exactly  the  states  of  the  NFA  that  represent  the  states 
used  in  the  regular  expression  specification  that  become  external  states  of  the  NFA. 

All  states  are  followed  by  the  letter  T  and  a  list  of  their  transitions,  that  is,  their  successors.  Finally 
there  are  conflict  statements  introduced  by  the  letter  G.  The  state  following  the  C  is  declared  to  conflict  with 
all  the  states  after  the  semicolon.  Conflicting  states  are  those  that  can  be  on  at  the  same  time,  a  result  not 
only  of  the  nondeterminism  but  of  the  fact  that  several  input  symbols  may  be  recognizable  at  once.  Conflicts 
among  states  are  taken  into  account  when  we  find  a  coding  of  the  NFA’s  states  for  an  implementation.  □ 


m.  Logic  Generation 


The  NFA  is  converted  to  the  logic  language  Igen  by  an  algorithm  described  in  [U].  Briefly,  the  nondeter- 
ministic  states  must  receive  representations  that  will  enable  us  to  identify  that  each  state  is  on,  regardless  of 
what  other  states  are  also  on  at  the  same  time.  Here  is  where  knowing  the  state  conflicts  may  help,  because 
when  state  i  is  on,  that  fact  can  only  be  obscured  by  states  that  conflict  with  t  also  being  on.  For  example, 
if  the  NFA  were  really  deterministic,  there  would  be  no  conflicts,  and  we  could  use  a  binary  coding  of  the 
states. 

One  way  to  code  states  is  to  give  each  a  private  signal.  Then  we  can  tell  the  state  is  on  independent  of 

any  other  states.  The  actual  approach  taken  by  the  logic  generator  used  is  slightly  more  sophisticated.  It 

attempts  to  identify  groups,  which  are  sets  of  mutually  nonconflicting  states.ff  Within  a  group,  binary  codes 

arc  selected  so  that  any  conflicting  states  from  other  groups  will  receive  the  same  code.  To  do  so,  a  minimal 

number  of  states  that  would  make  this  coding  impossible  are  expelled  from  groups  and  given  private  signals. 

Groups  of  a  single  state  are  similarly  given  private  signals. 

The  result  is  that  in  addition  to  private  signals,  there  are  code  bits  and  group  bits.  A  state  without  a 

private  signal  is  recognized  by  the  bit  of  its  group  being  on,  and  the  code  bits  being  on  or  ofT  as  appropriate 

t  [T]  describes  a  more  general  NFA  language  that  allows,  multiple  transitions,  (-transitions,  and  a  variety  of  options  not 
available  in  the  NKA  language  described  here. 

!  There  is  another  kind  of  slate  like  external  states,  that  docs  not  appear  in  Kig.  2.  These  stales,  called  advance  states  and 
designated  by  A,  arc  like  external  stales,  but  when  they  sec  their  input,  they  enable  their  successors  to  recognize  their  own 
inputs  at  the  same  lime  unit.  Advance  states  are  needed  to  implement  correctly  networks  of  NFA’s  that  together  recognise 
one  large  regular  expression.  Large  expressions  need  to  be  broken  into  pieces  implemented  by  separate  NKA’s  for  two  reasons. 
First,  processing  large  expressions  is  loo  time  consuming,  especially  minimising  the  stales  of  the  NFA  and  computing  conflicts. 
Second,  the  circuits  implementing  the  NKA’s  such  as  PLA’s  or  Weinberger  arrays,  will  be  too  large  and  badly  shaped  if  the 
NFA  has  too  many  states. 

ft  However,  before  looking  at  conflicts,  states  that  have  exactly  the  same  predecessors  (and  therefore  arc  really  just  different 
transitions  from  the  ’‘same’’  state)  are  combined  into  one. 
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o*l2  =  -'ini  •  -'inO  *  eat2 
ost3  >=  ini  •  inO  *  eal2 
oeti  mm  acka  •  eitl 
ost5  =*  ackb  •  at! 
ostS  =  ackc  *  eatS 
oat9  =  — <in  1  •  inO  *  eat! 
oetll  —  ini  *  ->inO  *  eatS 
oat  13  eta  tea 

oatM  =  -acka  *  eel  •  ->ce2  *  -<ce3 
oatl7  =  ini  *  inO  •  estl7 
ostlS  =  ->inl  *  inO  •  eatl8 
ostlS  =  ini  *  -'inO  *  eatl8 
ost22  =  acka  *  eel  *  ->ce2  *  -ieeS 
ostl6  =  ini  *  inO  »  ->ccl  *  ee2  *  -tceS 
osl29  =  -•ini  *  -iinO  *  --cel  •  ee2  a  -et] 
ost20  =  ini  *  inO  *  eel  *  ce2  *  -<ceS 
ost27  =  — «in  1  »  ->inO  *  cel  *  ce2  *  --cej 

ost31  =  -"ini  »  inO  *  eel  *  ce2  *  ->eeS  ■ 

oat33  =  ini  *  -iinO  »  eel  *  ee2  *  ->ee3 

ost21  =  atatee 

ost2S  —  ackc  *  -.cel  •  ->ce2  *  eeS 
osl26  =  ->ackc  »  -•cel  •  -»ce2  *  ce3 
ost23  =  -'ini  *  ->inO  *  eel  •  ->ce2  •  caS 
osl24  =  ini  *  inO  *  eel  *  ->ce2  *  eeS 
ost30  —  -'ini  *  inO  *  eel  *  ->ce2  *  ce3 

ost32  =  ini  *  -'inO  *  eel  *  ->ce2  •  ee3  - 

ost28  —  ackb  *  --cel  *  ce2  *  ce3 

ost34  =  stateb 

ost35  =  --ackb  •  --cel  *  ce2  •  ce3 

est2  =  LAST  fst2  CLEAR  globalinit  4-  inlt 

fst2  =  ost2  4-  ost3 

cst4  =  LAST  fst4  CLEAR  globalinit  +  init 

f»t4  =  oat2  +  os  13  4-  oal22  +  oat  16  +  oatSfl  +  ost20  +  ost27  +  oat21  +  oat25  4-  oat26  4-  oat23  +  oat24  4- 
oat28  4-  oat34  4-  ostSS 
eatS  =  LAST  fats  CLEAR  globalinit  +  init 

TatS  —  ost2  -f  ost3  +  oat  13  4-  oatl4  4-  oat22  4-  oatlS  +  oat28  +  oat20  4-  oat27  4-  oat21  4-  oat25  4-  oat26  4- 
oet23  4-  oat24  4-  oat28 
eal6  *  LAST  IstS  CLEAR  globalinit  4-  Init 

fat6  =  ost2  4-  ost3  4-  oatlS  4-  oetl4  4-  oat22  +  oat  16  4-  oat28  4-  oat20  4-  oat27  4-  oet25  4-  ost23  4-  oat24  4- 
oat28  4-  oat34  4-  oet35 
eatS  ■=  LAST  fatS  CLEAR  globalinit  4-  init 
fatS  *=*  ost2  4-  oat3  4-  o*tl6  4-  oat2S  4-  oat25 
eat  17  =-  LAST  Tatl7  CLEAR  globalinit 

fatl7  =  ostlS  4-  oatl4  4-  ost22  4-  oat20  4-  oat27  4-  oat21  4-  oat26  4-  oat2S  4-  oat24  4-  oat28  4-  ost34  4-  oatSS 

eat  18  —  LAST  fatl8  CLEAR  globalinit 

fstl8  ”=  oatlS  4-  oatl4  4-  oat21  4-'  oat26  4-  oat34  4-  oatSS 

error  ~  oat3  4-  oat4  4-  oatS  +  oatS  4-  oetl7  4-  oatlS  4-  oatlS  4-  oatlS 

atateb  =  oat  11  4-  oat30 

statea  so  ostS  4-  oat33 

atalcc  =  ost31  4-  oat32 

outa  «*  oat  13  4-  oat 1 4 

outc  =  oal21  4-  oat26 

ontb  =  osl34  4-  oat35 

eel  =  LAST  cCl  CLEAR  globalinit 

cTl  >»  oat  13  4-  oatl4  4-  oat22  4-  ost20  4-  oat27  4-  oal23  4-  oat24  4-  oatSS 
cc2  =  LAST  ef2  CLEAR  globalinit 

ef2  «=  oatl  6  4-  oat29  4-  ost20  4-  oat27  4-  oat25  4-  oat28  4-  oatS 4  4-  oatSS 
ec3  —  LAST  cR  CLEAR  globalinit 

cf3  a*  oal22  4-  ost21  4-  ost26  4-  ost23  +  oat24  4-  oat34  4-  oatSS 


Fig.  3.  Logic  implementing  communication  protocol. 
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to  its  code. 
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Example  3:  In  Fig.  3  we  see  the  output  of  the  NFA-to-Iogic  compiler,  with  certain  header  information, 
indicating  clocking  and  the  borders  on  which  signals  appear,  omitted.  Because  it  turns  out  that  there  is  only 
one  nontrivial  group,  and  that  group  does  not  have  exactly  a  power  of  two  states,  we  were  able  to  eliminate 
the  group  bit,  and,  by  not  using  the  all-zeros  code  for  any  state  in  the  group,  detect  the  presence  of  a  state 
in  the  group  by  one  or  more  of  the  code  bits  being  on. 

The  overall  organization  of  the  logic  in  Fig.  3  is  not  unlike  that  of  a  PLA.  The  variables  arc  in  three 
groups,  designated  by  the  letters  e,  f,  and  o.  The  first  group,  e,  corresponds  to  columns  in  the  and-planc  of 
a  PLA  and  represents  the  fact  that  a  certain  state  is  “enabled”;  if  its  input  symbol  is  now  seen  it  can  enable 
its  successors  for  the  next  input  cycle.  Some  states  have  private  enablers,  like  est2  for  state  2.  Other  states 
arc  coded,  and  in  Fig.  3  there  arc  three  coded  enabler  variables  cel,  ce2,  and  ce3,  combinations  of  which 
represent  the  enablers  for  various  states.  Note  that  not  every  state  has  an  enabler,  cither  private  or  coded. 
States  without  enablers  have  the  same  entering  transitions  as  some  other  state  that  does  have  an  enabler, 
and  the  same  enabler  serves  for  both. 

The  f  group  of  variables  are  “feedback”;  they  correspond  to  columns  in  the  or-plane.  State  fX  at  one 
time  unit  becomes  eX  at  the  next  time  unit  by  means  of  Igen  statements  such  as 
eat2  =  LAST  /at 2  CLEAR  globalinit  +  init 

which  says  that  state  2  is  enabled  cither  by  the  initial  signal  init,  or  by  fst2  being  on  at  the  previous  time 
unit.  The  output  signals,  such  as  atatea  or  OUT  A,  also  correspond  to  columns  of  the  or-plane. 

The  o  group  of  variables  correspond  to  the  terms  of  the  PLA.  For  each  state  there  is  an  o  variable  that 
is  turned  on  when  the  state  is  enabled,  and  the  proper  input  is  seen.  For  example,  line  1  of  Fig.  3  says  oat 2 
is  on  whenever  state  2  is  enabled  (eat2  is  on)  and  noin  is  seen  on  the  input  (detected  by  both  wires  mO 
and  ini  being  off).  Line  9  says  that  ostl4  is  turned  on  when  input  noacka  is  seen  and  stale  14  is  enabled 
(represented  by  the  coded  enabler  bits  being  100). 

The  only  difference  between  a  PLA  structure  and  the  organization  of  the  variables  in  Fig.  3  is  that  the 
atatea,  stateb  and  statec  variables  do  not  (it  into  the  scheme.  Rather,  we  can  view  them  as  fed  back  from  the 
or-plane,  where  they  arc  generated,  to  the  and-planc,  where  they  arc  used,  with  no  delay  due  to  clocking. 
Thus,  Fig.  3  can  be  used  almost  directly  as  input  to  a  PLA  generator  that  permits  undocked  signals  as  an 
option. 
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IV.  Evaluation  of  Results 


It  is  difficult  to  compare  the  logic  of  Fig.  3  with  the  “best  possible”  logical  description  of  an  equivalent 
circuit.  It  appears  that,  when  the  ability  of  the  Igen  compiler  to  eliminate  common  subexpressions  and 
perform  other  optimizations  is  taken  into  account,  the  resulting  logic  will  be  very  close  to  that  found  in  the 
hand  designed  PLA  described  below.  Thus,  we  are  optimistic  that  our  automatic  synthesis  method  behaves 
very  well  when  amount  of  logic  generated  is  the  criterion  used. 

We  can  obtain  a  more  concrete  estimate  of  the  quality  of  the  circuit  designed  if  we  view  it  as  a  PLA 
specification  and  compare  it  with  a  PLA  designed  carefully  by  hand.  In  our  hand  design,  we  used  three 
feedback  wires.  Two  were  used  to  binary  code  the  “state,”  i.e.,  whether  we  were  in  the  start  condition,  or 
in  what  we  have  called  states  a,  b,  and  c.  The  third  feedback  wire  indicated  whether  we  were  waiting  for  an 
acknowlcgcment  or  had  received  the  acknowledgement  and  were  waiting  for  the  next  input.  Terms  based  on 
this  encoding  were  written  down  and  optimized  using  the  gry  PLA  optimizer  [If].  The  resulting  PLA  had: 

1.  32  terms. 

2.  17  columns  in  the  and-plane,  representing  an  initializing  signal,  the  three  feedback  wires,  and  the  five 
input  wires  (t'nO,  ml,  acka,  ackb,  and  ackc),  each  of  which  except  the  initializer  requires  inversion. 

3.  7  columns  in  the  or-plane,  representing  the  three  feedback  wires  and  four  outputs  (ERROR,  OUTA, 
OUTB,  and  OUTC). 

The  resulting  area  is  32  *  (17  +  7)  =  788. 

In  comparison,  the  PLA  constructed  directly  from  Fig.  3  requires  the  following: 

1.  30  terms  (the  o  variables  plus  one  term  to  carry  the  initial  signal  to  the  or- plane). f 

2.  27  columns  in  the  and-plane,  consisting  of 

a)  10  columns  for  the  inputs  and  their  complements. 

b)  7  columns  for  the  private  state  enablers;  these  do  not  have  to  be  inverted. 

c)  3  columns  for  statea,  atateb,  and  statec;  these  also  do  not  require  inversion. 

d)  6  columns  for  the  three  coded  enablers,  which  do  require  inversion. 

e)  1  column  for  the  initial  signal.  ' 

3.  17  columns  in  the  or-plane,  consisting  of  four  output  signals  and  13  feedback  wires. 

The  resulting  size  is  30  *  (27  +  17)  =  1320.  This  figure  is  72%  greater  than  the  hand-designed  one.  The 

overhead  of  the  PLA  borders  will  tend  to  reduce  this  figure  somewhat,  as  will  the  fact  that  clocking  is  not 

needed  on  six  of  the  columns  of  the  machine-generated  PLA.  But  the  fact  that  space  is  required  for  13 

f  It  is  not  unusual  for  PLA’s  generated  from  regular  expressions  to  have  fewer  terms  than  hand-designed  ones,  because  the 
former  Pl.A’s  tend  to  use  one-hot  codes  (private  enablers)  for  slates,  and  that  sort  of  code  costs  columns,  but  may  save  terms. 


symbol  inO{x[l]  -x[2j) 
inl(x|2j  -xjlj) 
badin(x(l]  x[2j) 

*ck(x[3j) 
atstcia(x(4]  x[5]) 
stalcib(x[4]  -x[5|) 
stateic(-x[-1)  x(5]) 
start(-x[4)  -x[S]) 
noin(-x(l]  -x[2)) 
noack(-x[3]) 
output  OUTA 
OUTB 
OUTC 
stateoa 
stateob 
statcoc 
ERROR 

subcxp  somein=inO  +  ini  +  badin 
subcxf  waitin=noin  +  badin 
subexp  allbul01=ack  +  badin 
> 

start  waitin*  ( 

allbutoi  ERROR  + 
inO  stateoa  + 
ini  stateob 
) 

+ 

stateia  noack*  OUTA  ( 

somcin  ERROR  + 
ack  waitin*  ( 

allbutOl  ERROR  -f 
inO  stateob  + 
ini  stateoe 
) 

)  . 

+ 

stateib  noack*  OUTB  ( 

somcin  ERROR  + 
ack  waitin*  ( 

allbutOl  ERROR  + 
inO  stateoe  + 
ini  stateoa 
) 

) 

+ 

stateic  noack*  OUTC  ( 

somcin  ERROR  + 
ack  waitin*  ( 

allbutOl  ERROR  + 
inO  stateoa  + 
in  1  stateob 
) 

) 


Fig.  4.  Revised  input  to  regular  expression  compiler. 


feedback  wires  for  the  machine- generated  PLA  will  serve  to  increase  the  ratio. 


V.  Correction  of  Errors 

One  important  advantage  of  the  regular  expression  approach  to  design,  as  with  high-level  descriptions  in 
general,  is  that  modifications  arc  easier  to  make,  and  make  reliably,  than  with  ad-hoc  designs. 

Example  4:  It  turns  out  that  our  design  of  Pig.  1  is  not  the  simplest  that  meets  the  specifications  of  [AUY]. 
Rather,  since  the  channel  is  assumed  never  to  make  a  mutation  error,  only  to  lose  signals,  there  is  no  need 
to  distinguish  between  the  three  acknowledgements;  whenever  we  receive  an  acknowledgement,  we  know  it 
was  actually  sent  by  the  receiver,  and  we  know  that  if  the  receiver  is  not  broken,  then  it  was  sent  in  response 
to  the  receipt  of  the  correct  signal. 

Thus,  we  should  modify  Fig.  1  in  the  following  two  ways. 

1.  acka,  ackb,  and  ackc  should  be  identified  as  the  signal  ack;  similarly,  their  complements,  noacka,  etc., 
are  identified  as  noack. 

2.  While  waiting  for  an  acknowledgement,  there  are  no  “wrong  acknowledgements”  to  receive,  so  terms 
like 

(acfcb  +  ackc  +  somein)  ERROR 

should  be  replaced  by 

somein  ERROR 

The  resulting  revision  is  shown  in  Fig.  4. 

VI.  Conclusions 

Regular  expressions,  in  conjunction  with  conventional  state  machine  definitions  of  processes,  is  a  promis¬ 
ing  way  to  design  circuits  at  a  very  high  level.  The  use  of  optimizing  logic  compilers  to  do  the  actual 
implementation  may  be  superior  to  PLA  implementation  of  regular  expressions,  but  the  evidence  of  the  case 
described  in  this  paper,  and  other  cases  we  have  analyzed  by  hand,  is  that  even  PLA  implementation  of 
regular  expressions  comes  within  a  factor  of  two  of  the  area  used  by  hand-designed  PLA’s.  Further,  the 
problem  of  coding  nondeterministic  states  is  not  yet  fully  resolved,  and  there  is  hope  that  better  PLA  and/or 
logic  implementations  of  regular  expressions  will  be  developed  in  the  future. 
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